# Large-scale Dataset
Vit Base Patch32 Clip 224.metaclip 400m
A vision-language model trained on the MetaCLIP-400M dataset, supporting zero-shot image classification tasks
Image Classification
V
timm
2,406
0
Sambalingo Arabic Chat
Llama 2 is an open-source large language model developed by Meta, supporting multiple languages including English and Arabic.
Large Language Model
Transformers Supports Multiple Languages

S
sambanovasystems
362
63
Openclip Resnet50 CC12M
MIT
OpenCLIP model based on ResNet50 architecture and trained on the CC12M dataset, supporting zero-shot image classification tasks.
Image Classification
O
thaottn
13.67k
0
Languagebind Video V1.5 FT
MIT
LanguageBind is a language-centric multimodal pretraining method that uses language as the bond between different modalities to achieve multimodal semantic alignment.
Multimodal Alignment
Transformers

L
LanguageBind
853
5
Languagebind Video FT
MIT
LanguageBind is a language-centric multimodal pretraining method that uses language as the bond between different modalities to achieve semantic alignment across video, infrared, depth, audio, and other modalities.
Multimodal Alignment
Transformers

L
LanguageBind
22.97k
4
Eva02 Large Patch14 Clip 336.merged2b S6b B61k
MIT
EVA02 is a large-scale vision-language model based on the CLIP architecture, supporting zero-shot image classification tasks.
Text-to-Image
E
timm
15.78k
0
Wavlm Base Plus
WavLM is a large-scale self-supervised pretrained speech model developed by Microsoft, pretrained on 16kHz sampled speech audio, suitable for various speech processing tasks.
Speech Recognition
Transformers English

W
microsoft
673.32k
31
Vit Base Patch16 224 In21k
Apache-2.0
A Vision Transformer model pretrained on the ImageNet-21k dataset for image classification tasks.
Image Classification
V
google
2.2M
323
All Datasets V3 Mpnet Base
Apache-2.0
Sentence embedding model based on MPNet architecture, mapping text to a 768-dimensional vector space, suitable for semantic search and sentence similarity calculation
Text Embedding English
A
flax-sentence-embeddings
3,472
13
Featured Recommended AI Models